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n 1: NM_008489 Related Sequences, Protein, PubMed, Taxonomy, UniSTS 

Mus musculus lipopolysaccharide binding protein (Lbp), mRNA 
gi|6678669|reflNM_008489. 1 1 [6678669] 

□ 2: X99347 ProbeSet, Related Sequences, Protein, PubMed, Taxonomy, UniSTS 

M.musculus mRNA for LPS-binding protein 
gi|1430866|emb|X99347.1|MMLPS[1430866] 

□ 3: U43202 Related Sequences, Protein, PubMed, Taxonomy 

RFP provirus of Murine leukemia virus gag precursor protein, pol precursor 
protein, env precursor protein genes, complete cds 
gi| 1 39404461gb|U43202. 1 |MLU43202[1 3940446] 
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n 1: NM_008489. Mus musculus lipo. 
[gi:6678669] 



Related Sequences, Protein, PubMed, Taxonomy, UniSTS, 

LinkOut 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



gene 



NM_008489 1925 bp mRNA linear ROD 07-JAN-2002 

Mus musculus lipopolysaccharide binding protein (Lbp) , mRNA. 
NM_008489 

NM_008 4 8 9. 1 GI : 667 8 669 

house mouse. 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (bases 1 to 1925) 

Lengacher , S . , Jongeneel , C . V . , Le Roy,D., Lee,J.D., Kravchenko, V. , 
Ulevitch, R. J. , Glauser,M.P. and Heumann,D. 

Reactivity of murine and human recombinant LPS-binding protein 
(LBP) within LPS and gram negative bacteria 
J. Inflamm. 47 (4), 165-172 (1995) 
97289150 
9144073 

2 (bases 1 to 1925) 

Lengacher , S . , Reed,D., Heumann^D. and Jongeneel, C . V. 

Genomic organization and chromosomal localization of the mouse 

lipopolysaccharide binding protein gene 

Immunogenetics 49 (6), 553-556 (1999) 

99309821 

10380702 

3 (bases 1 to 1925) 

Heinrich, J, M. , Bernheiden, M . , Minigo,G., Yang,K.K., Schutt,C., 
Mannel,D.N. and Jack,R.S. 

The essential role of lipopolysaccharide-binding protein in 

protection of mice against a peritoneal Salmonella infection 

involves the rapid induction of an inflammatory response 

J. Immunol. 167 (3), 1624-1628 (2001) 

21359554 

11466385 

PROVISIONAL REFSEQ: This record has not yet been subject to final 
NCBI review. The reference sequence was derived from X9934 7 . 1 . 
Location /Qualifiers 
1. . 1925 

/organism="Mus musculus" 
/strain="BALB/c" 
/db_xref="taxon: 100 90" 
/chromosome="2" 
/map- "2 83.0 cM" 
/cell_type="hepatocyte" 
/dev__stage=" adult" 
1. . 1925 
/gene="Lbp" 
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/note="Ly88" 

/ db_xre f= " Locus I D : 16803 " 
/db_xref-"MGD : 1098776 " 
CDS 128.. 1573 

/gene-"Lbp" 
/ codon_start=l 
/db xref="LQCusID: 16803 " 
/ db_xr e f = "MGD : 1098776 " 

/product="lipopolysaccharide binding protein" 
/protein id-" NP 032515.1 " 
/db_xref="GI: 6678670" 

/translation-"MKAGTGPLLSTLLGLLFLSIQGTGCVNPGVVARITDKGLAYAAK 
EGLVALKRELYKITLPDFSGDFKIKAVGRGQYEFHSLEIQNCELRGSSLKLLPGQGLR 
LAISDSSIGVRGKWKVRKSFLKLHGSFDLDVKGVTISVDLLLGMDPSGRPTVSASGCS 
SRICDLDVHISGNVGWLLNLFHNQIESKLQKVLENKVCEMIQKSVTSDLQPYLQTLPV 
TAEIDNVLGIDYSLVAAPQAKAQVLDVMFKGEIFNRNHRSPVATPTPTMSLPEDSKQM 
VYFAISDHAFNIASRVYHQAGYLNFSITDDMLPHDSGIRLNTKAFRPFTPQIYKKYPD 
MKLELLRTVVSAPILNVSPGNLSLAPQMEIEGFVILPTSAREPVFRLSVVTNVFASLT 
FNTRKVTGMLHPDKAQVRLIESKVGIFNVNLFQAFLNYYLLNSLYPDVNAELAQGFPL 
PLPRHIQLHDLDFQIRKDFLYLGANVQYMRV" 
203. .841 

/note="LBP_BPI_CETP; Region: LBP / DPI / CETP family, 
N-terminal domain" 
224 . .895 

/note="BPIl; Region: BPI/LBP/CETP N-terminal domain" 
854 . . 1561 

/note="LBP_BPI_CETP_C; Region: LBP / BPI / CETP family, 
N-terminal domain" 
938. .1549 

/note="BPI2; Region: BPI/LBP/CETP C-terminal domain" 
BASE COUNT 427 a 565 c 481 g 452 t 

ORIGIN 

1 ggctttcccg cttccaggag cccgccctcc ctcaagaacc gccccctagc cctgggcctt 
61 gactagttaa gaacaccaga gccagcccac tctgtcacag tcctcactcc cccacggagc 
121 atccaggatg aaggctggga ctggacctct gctctctaca ttgctggggc tgctgttcct 
181 gtccatacag ggaactggct gtgtcaaccc cggtgtggtg gccaggatca ccgacaaggg 
241 cctggcctac gcggccaagg aggggctggt ggctctgaag agagagctgt acaagatcac 
301 actaccggac ttcagcgggg acttcaagat caaggccgtg ggtcgtgggc agtacgagtt 
361 tcatagcctg gagattcaga actgtgagct gcgtggctcc tccctgaagc tgctcccagg 
421 ccagggcctg agactcgcca tctctgactc ttccatcggt gtccgaggca aatggaaggt 
481 gcgcaaatcc ttcctgaaac ttcatggctc ctttgacctg gatgtcaaag gtgtcaccat 
541 ttcagtggac ctcctcctgg gcatggatcc ctcagggcgg cccacagtct ctgcctctgg 
601 gtgcagcagc cgcatttgtg atttggatgt gcacatatca ggaaatgtgg ggtggctgct 
661 gaatctcttc cacaaccaga tcgagtcgaa gctccagaaa gtactggaga ataaggtttg 
721 tgagatgatc cagaagtctg tgacctctga cctgcagcct tatctccaga ctctgccagt 
781 cacagcagag atcgacaatg tcctgggcat tgactacagt ttggtggcgg ctccccaagc 
841 gaaggcccag gtgctggatg tcatgtttaa gggtgaaatt tttaatcgga atcaccgctc 
901 tccagttgct acccccactc cgaccatgag cctacctgag gacagtaagc aaatggtcta 
961 ctttgccatc tcagatcatg ccttcaacat agcaagccgg gtttatcacc aggctgggta 
1021 cctgaacttt tccatcacag atgacatgtt accacatgac tccggcatcc ggttgaacac 
1081 caaggccttc cgtcccttca ctccccagat atacaaaaag tatcccgaca tgaaattgga 
1141 gctccttaga acagtggtct ctgccccgat tctgaatgtc agtcctggga atctgtcctt 
1201 ggccccacag atggagatcg aaggctttgt gatcctgccc acctccgccc gggaacctgt 
1261 cttccggctt agcgtggtca ctaatgtatt tgcctcatta actttcaaca cacgcaaggt 
1321 taccgggatg ctgcatccag acaaggcaca agtgagactg atcgaatcca aagtcggcat 
1381 attcaatgtg aacctgttcc aggcattcct taactactac cttctcaaca gcctctaccc 
1441 tgatgtcaat gctgagctgg cccagggctt ccccctccct ctgccaaggc atattcagct 
1501 ccacgacctc gacttccaga tccgcaagga cttcttatac ttgggtgcca atgtccagta 
1561 tatgagagtc tgaggacaag aagaaagatg ggtctgagtg gcctcagctg gaccggccgt 
1621 ttgtaattcc agatgtacag cacatctctg gagagtctcc aaaatacaaa gatgtttctg 
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1681 ctcctggctc ggggtgggcc ctgaccccag 

1741 ggacttgact ccgtcccatc ctccaggaag 

1801 gggccctctc caacactggg tgggagacct 

1861 ttaaattcac catctgacct tcatgtttga 

1921 accgg 



agtcctcttc agcaggtgca cccttgacct 

ggcccctccc ctcactgact tctccaggaa 

ggaatcactt taagagcggg cactgtggtt 

cagggtgctg gcactttgta ggtcctcaat 
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n 1: X99347. M.musculus mRNA f 
[gi:1430866] 



ProbeSet, Related Sequences, Protein, PubMed. Taxonomy, UnlSTS, 

LinkOut 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
CDS 



Jongeneel^ Ludwig Institute for Cancer 
Chemin des Boveresses 155, CH-1066 



MMLPS 1925 bp mRNA linear ROD 17-JUL-1997 

M.musculus mRNA for LPS-binding protein. 

X99347 

X99347.1 GI:1430866 

LBP gene; LPS-binding protein. 

house mouse. 

Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi / 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (bases 1 to 1925) 

Lengacher, S. , Jongeneel, C . V. , Le Roy,D., Lee,J.D., Kravchenko, V. , 
Ulevitch, R. J. , Glauser,M.P. and Heumann, D. 

Reactivity of murine and human recombinant LPS-binding protein 

(LBP) within LPS and gram negative bacteria 

Journal of inflammation. 47 (4), 165-172 (1995) 

97289150 

9144073 

2 (bases 1 to 1925) 
Jongeneel, V. 
Direct Submission 
Submitted ( 15- JUL-1996 ) V. 
Research, Lausanne Branch, 
EPALINGES, Switzerland 

Location/ Qualifiers 
1. .1925 

/organism="Mus musculus" 

/strain="BALB/c" 

/db_xref="taxon: 10090" 

/cell_t ype="hepatocyte" 

/dev_stage= "adult" 

128 . . 1573 

/gene="LBP" 

128 . . 1573 

/gene="LBP" 

/codon_start=l 

/product="LPS-binding protein" 
/protein id-" CAA67727 . 1 " 
/db_xref ="GI : 14 308 67 " 
/db_xref="SWISS-PROT:Q61805" 

/translation="MKAGTGPLLSTLLGLLFLSIQGTGCVNPGVVARITDKGLAYAAK 
EGLVALKRELYKITLPDFSGDFKIKAVGRGQYEFHSLEIQNCELRGSSLKLLPGQGLR 
LAISDSSIGVRGKWKVRKSFLKLHGSFDLDVKGVTISVDLLLGMDPSGRPTVSASGCS 
SRICDLDVHISGNVGWLLNLFHNQIESKLQKVLENKVCEMIQKSVTSDLQPYLQTLPV 
TAEIDNVLGIDYSLVAAPQAE<AQVLDVMFKGEIFNRNHRSPVATPTPTMSLPEDSKQM 
VYFAISDHAFNIASRVYHQAGYLNFSITDDMLPHDSGIRLNTKAFRPFTPQIYKKYPD 
MKLELLRTVVSAPILNVSPGNLSLAPQMEIEGFVILPTSAREPVFRLSVVTNVFASLT 
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FNTRKVTGMLHPDKAQVRLIESKVGIFNVNLFQAFLNYYLLNSLYPDVNAELAQGFPL 

PLPRHIQLHDLDFQIRKDFLYLGANVQYMRV" 
BASE COUNT 427 a 565 c 481 g 452 t 

ORIGIN 

1 ggctttcccg cttccaggag cccqccctcc ctcaagaacc gccccctagc cctgggcctt 
61 gactagttaa gaacaccaga gccagcccac tctgtcacag tcctcactcc cccacggagc 
121 atccaggatg aaggctggga ctggacctct gctctctaca ttgctggggc tgctgttcct 
181 gtccatacag ggaactggct gtgtcaaccc cggtgtggtg gccaggatca ccgacaaggg 
241 cctggcctac gcggccaagg aggggctggt ggctctgaag agagagctgt acaagatcac 
301 actaccggac ttcagcgggg acttcaagat caaggccgtg ggtcgtgggc agtacgagtt 
361 tcatagcctg gagattcaga actgtgagct gcgtggctcc tccctgaagc tgctcccagg 
421 ccagggcctg agactcgcca tctctgactc ttccatcggt gtccgaggca aatggaaggt 
481 gcgcaaatcc ttcctgaaac ttcatggctc ctttgacctg gatgtcaaag gtgtcaccat 
541 ttcagtggac ctcctcctgg gcatggatcc ctcagggcgg cccacagtct ctgcctctgg 
601 gtgcagcagc cgcatttgtg atttggatgt gcacatatca ggaaatgtgg ggtggctgct 
661 gaatctcttc cacaaccaga tcgagtcgaa gctccagaaa gtactggaga ataaggtttg 
721 tgagatgatc cagaagtctg tgacctctga cctgcagcct tatctccaga ctctgccagt 
781 cacagcagag atcgacaatg tcctgggcat tgactacagt ttggtggcgg ctccccaagc 
841 gaaggcccag gtgctggatg tcatgtttaa gggtgaaatt tttaatcgga atcaccgctc 
901 tccagttgct acccccactc cgaccatgag cctacctgag gacagtaagc aaatggtcta 
961 ctttgccatc tcagatcatg ccttcaacat agcaagccgg gtttatcacc aggctgggta 
1021 cctgaacttt tccatcacag atgacatgtt accacatgac tccggcatcc ggttgaacac 
1081 caaggccttc cgtcccttca ctccccagat atacaaaaag tatcccgaca tgaaattgga 
1141 gctccttaga acagtggtct ctgccccgat tctgaatgtc agtcctggga atctgtcctt 
1201 ggccccacag atggagatcg aaggctttgt gatcctgccc acctccgccc gggaacctgt 
1261 cttccggctt agcgtggtca ctaatgtatt tgcctcatta actttcaaca cacgcaaggt 
1321 taccgggatg ctgcatccag acaaggcaca agtgagactg atcgaatcca aagtcggcat 
1381 attcaatgtg aacctgttcc aggcattcct taactactac cttctcaaca gcctctaccc 
1441 tgatgtcaat gctgagctgg cccagggctt ccccctccct ctgccaaggc atattcagct 
1501 ccacgacctc gacttccaga tccgcaagga cttcttatac ttgggtgcca atgtccagta 
1561 tatgagagtc tgaggacaag aagaaagatg ggtctgagtg gcctcagctg gaccggccgt 
1621 ttgtaattcc agatgtacag cacatctctg gagagtctcc aaaatacaaa gatgtttctg 
1681 ctcctggctc ggggtgggcc ctgaccccag agtcctcttc agcaggtgca cccttgacct 
1741 ggacttgact ccgtcccatc ctccaggaag ggcccctccc ctcactgact tctccaggaa 
1801 gggccctctc caacactggg tgggagacct ggaatcactt taagagcggg cactgtggtt 
1861 ttaaattcac catctgacct tcatgtttga cagggtgctg gcactttgta ggtcctcaat 
1921 accgg 

// 
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n 1: U43202. RFP provirus of M...[gi: 13940446] 



Related Sequences, Protein, PubMed, Taxonomy 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



LTR 



MLU43202 8892 bp DNA linear VRL 04-MAY-2001 

RFP provirus of Murine leukemia virus gag precursor protein, pol 
precursor protein, env precursor protein genes, complete cds . 
U43202 

U43202 .1 GI : 13940446 

Murine leukemia virus. 
Murine leukemia virus 

Viruses; Retroid viruses; Retroviridae ; Gammaretrovirus . 

1 (bases 1 to 8892) 

Pedersen,L., Behnisch,W., Schmidt, J., Luz,A., Pedersen, F . S . , 
Erfle,V. and Strauss, P. G. 

Molecular cloning of osteoma-inducing replication-competent murine 

leukemia viruses from the RFB osteoma virus stock 

J. Virol. 66 (10), 6186-6190 (1992) 

92408015 

1326664 

2 (bases 1 to 8892) 

Gimbel,W., Schmidt, J., Brack-Werner , R. , Luz,A., Strauss , P . G . , 
Erfle,V. and Werner, T. 

Molecular and pathogenic characterization of the RFB osteoma virus: 
lack of oncogene and induction of osteoma, osteopetrosis, and 
lymphoma 

Virology 224 (2), 533-538 (1996) 

97028500 

8874513 

3 (bases 1 to 8892) 
Gimbel, W. 

Aufklaerung der Primaerstruktur des Provirus des murinen RFB 
Leukaemie virus 

Thesis (1993) Institut fuer Molekulare Virologie, 
GSF-Forschungszentrum fuer Umwelt und Gesundheit, Neuherberg 

4 (bases 1 to 8892) 
Gimbel, W. 

Direct Submission 

Submitted (13-DEC-1995) Wolfgang Gimbel, Institut fuer Molekulare 
Virologie, GSF-Forschungszentrum fuer Umwelt und Gesundheit, 
Neuherberg, Ingolstaedter Landstrasse 1, Oberschleissheim D-85764, 
Germany 

Location /Qualifiers 
1. .8892 

/organism="Murine leukemia virus" 
/proviral 

/db_xref="taxon: 11786" 
/note="RFB provirus" 
1. .605 

/note="5'LTR" 
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/nn-h p = " ' i n vpt1" pH rpnf^si" nf S'T.TR" 

/ 1 l_ C — XliVCXUv!^ Vj4 X C d l_ V_/ J_ ^ J_l X £\. 




T TR 


X . , U J 






/nn-hp^"!]"^ reaion of 5'LTR" 




[tJ J_ l_ C X 1 1 


31 . .36 






/bound moiety="Myb vertebrate RNA polymerase 


II 




transcription factor" 




T^T'(^t" pi 1 Tl In 1 "nH 


40. .50 






/bound moiety="LBP" 




jw-' J- U L. t; -L 1 i iJXliLJ. 


48. .56 






/note="LAP, IL-6DBP, H-APF-2, AGP/EBP preliminary 




/bound moiety="NF-IL6" 




■ptT"f^'t"f^n Tl V^T TlH 


102 . . 107 






y 1 1 ' J_ IT "n TV '^11 

/bound moxety="PEA3" 




piTot 6in bind 


14 6. .152 






/note="E12^ panl,E47, pan2" 






/bound moiety="E2A" 




T'OT^O^'l" llTll "H 


164 . .227 






/note="l. large LTR direct repeat" 




mT'^^'hoT Tl V^T Tl(H 


171 . .181 






/note="s-CBF preliminary pattern" 






/bound moiety="SEF-l" 




T17"r^'t~01Tl V*l"lTlH 
jJXL-'L-CXii iJXlUJ. 


173 . . 180 






/bound moiety="AP3" 






173. . 182 






/note="enhancer core A" 




j»jxwi_c;xii J- 1 


191 . . 198 






/bound moiety=="AP2" 




nr"0't*(^iTi HiTiH 


193. . 199 






/bound moiety="GCF" 




mTT^'hciiTi V^iTi(H 

j-JXULCXii J^XiiLi 


194 . . 207 






/note="CTF preliminary pattern" 






/bound moiety="NFl" 




JJXU'UcXii iJXIH^ 


210 . . 216 






/note="E12, panl,E47, pan2" 






/bound moiety="E2A" 




f=i n fii p "h n n "1 i" 


228 . . 291 






/note="2. large LTR direct repeat" 




T^yr^'hp^ii Tl y^i TlH 


235 . . 245 






/note="S-CBF preliminary pattern" 






/bound_moiety="SEF-l" 




f^TiH^'nf^ciT^ 
eiiiidiH—ex 


237 . .246 






/note^"enhancer core B" 




T^ V "t" d T Tl V\ T Tl 

pxLJLfcJXIl JJXllCJ. 


237 . .244 






/bound moiety="AP3" 




mT'Oi^OT Tl V^T Tl(H 
J-JX <^ L t; X 11 J.JXii(J. 


257 . .263 






/bound moiety="GCF" 




T^T'O'hp'i Tl hi TlH 


258 . .271 






/Tio't~c^="r^T'T^^ TM^ol imnTi;3T\7 t^s'H'Hovti" 
/liL-'L.c; — ^J-E ^xtrXXlLLXildX^ ^clL.L.t;xil 






/bound moiety="NFl" 




TlT'Pii^f^T Tl hi TlH 


274. .280 






/note="E12, panl,E47, pan2" 






/bound moiety="E2A" 






383. .387 






/note="CAAT box of 5 "LTR" 




TATA siqnal 


434 . . 438 






/note=="TATAA box of 5'LTR" 




protein bind 


434 . . 439 






/bound moiety="TBP" 




protein bind 


462. .468 




cb 


h g e e e e fcg e e b c e e 





NCBI Sequence Viewer 



Page 3 of 7 



/note="preliminary pattern; core of Behnisch" 

/bound_mo i e t y- " H - AP F- 1 " 
protein bind 462. .468 

/bound_mo i e t y= " GC F " 
LTR 464. .531 

/note-"R region of 5* LTR" 
prim_transcript 464. .8818 

/note=" retroviral RNA" 
polyA_signal 510. .515 

/note="poly-A signal of 5 'LTR" 
protein_bind 526. .533 

/bound__moiety="NF-Wl" 
protein_bind 526. .533 

/bound_moiety="NF-W2 " 
LTR 532. .605 

/note="U5 region of 5 ' LTR" 
repeat_reqion 595. .605 

/note="3 ' inverted repeat of 5 ' LTR" 
raise feature 606. .1099 

/note^"leader 5' untranslated region" 
inisc_bindinq 608. .625 

/bound_moiety="tRNA" 
misc signal 662. .667 

/note="splice donor site" 
gene 1100.. 2713 

/gene="gag" 
CDS 1100. .2713 

/gene="gag" 

/codon_start=l 

/product="gag precursor protein" 
/protein id=" AAK50380 . 1 " 
/db_xref="GI: 13940447" 

/translation-"MGQTVTTPLSLTLKHWEDVQRIASNQSVDVKKRRWVTFCSAEWP 
TFGVGWPQDGTFNLDIILQVKSKVFSPGPHGHPDQVPYIVTWEAIAYEPPPWVKPFVS 
PKLSPSPTAPVLPSGPSTQPPPRSALYPALTPSIKPRPSKPQVLSDNGGPLIDLLSED 
PPPYGGQGLSSSDGDGDREEATSTSEIPAPSPIVSRLRGKRDPPAADSTTSRAFPLRL 
GGNGQLQYWPFSSSDLYNWKNNNPSFSEDPGKLTALIESVLTTHQPTWDDCQQLLGTL 
LTGEEKQRVLLQARKAVRGNDGRPTQLPNEVDAAFPLERPDWDYTTQRGRNNLVLYRQ 
LLLAGLQNVGRSPTNLAKVKGITQGPNESPSAFLERLKEAYRRYTPYDPEDPGQETNV 
SMSFIWQSAPDIGRKLERLEDLKSKTLGDLVREAERFFNKRETPEEREERVRRETEEK 
EERRRAEEEQKEKERDRKRHREMSKLLATVVSGQRQDRQGGERRRPQLDKDQCAYCKE 
KGHWAKDCPKKPRGPRGPRPQTSLLTLDD" 

stem loop 2693. .2731 

gene 2714 . . 6304 

/gene="pol" 

CDS <2714..6304 
/gene="pol" 

/note="pol precursor protein" 

/codon_start=l 

/protein id=" AAK5Q381 . 1 " 

/db_xref="GI: 13940448" 

/translation="GGQGQEPPPEPRITLTVGGQPVTFLVDTGAQHSVLTQNPGPLSD 
RSAWVQGATGGKRYRWTTDRKVHLATGKVTHSFLHVPDCPYPLLGRDLLTKLKVQIHF 
EGSGAQVVGPKGQPLQVLTLNLEDEYRLYETSAEPEASPGSTWLSDFPQAWAETGGMG 
LAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPWNTPL 
LPVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGLPPSHRWYTVLDLKDAFF 
CLRLHPTSQPLFAFEWRDPGMGISGQLTWTRLPQGFKNSPTLFDEALHRDLAGFRIQH 
PDLILLQYVDDLLLAATSELDCQQGTRALLQTLGDLGYRASAKKAQICQKQVKYLGYL 
LKEGQRWLTEARKETVMGQPIPKTPRQLREFLGTAGFCRLWIPGFAEMAAPLYPLTKT 
GTLFNWGPDQQKAYQEIKQALLTAPALGLPDLTKPFELFVDEKQGYAKGVLTQKLGPW 
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stemloop 
inisc_f eature 

gene 
CDS 



mat peptide 



RRSVAYLSKKLDPVAAGWPPCLRMVAAIAVLTKDAGKLTMGQPLVILAPHAEEALVKQ 

PPDRWLSNARMTHYQAMLLDTDRVQFGPVVALNPATLLPLPEEGAPHDCLEILAETHG 

TRPDLTDQPIPDADHTWYSDGSSFLQEGQRKAGAAVTTETEVIWARALPAGTSAQRAE 

LIALTQALKMAEGKRLNVYTDSRYAFATAHIHGEIYRRRGLLTSEGREIKNKSEILAL 

LKALFLPKRLSIIHCLGHQKGDSAEARGNRLADQAAREAAIKTPPDTSTLLIEDSTPY 

TPAYFHYTETDLKKLRELGATYNQSKGYWVFQGKPVMPDQFVFELLDSLHRLTHLGYQ 

KMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKAKIGAGVRVRGHRPGSHWE 

IDFTEVKPGLYGYKYLLVFVDTFSGWVEAFPTKRETARVVSKKLLEEIFPRFGMPQVL 

GSDNGPAFTSQVSQSVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLAAGT 

RDWVLLLPLALYRARNTPGPHGLTPYEILYGAPPPLVNFHDPDMSELTNSPSLQAHLQ 

ALQTVQREIWKPLAEAYRDQLDQPVIPHPFRIGDSVWVRRHQTKNLEPRWKGPYTVLL 

TTPTALKVDGISAWIHAAHVKAATTPPIKPSWRVQRSQNPLKIRLTRGAP" 

2722. .2746 

/gene="pol" 

5967. .5979 

/ gene="pol " 

/note="splice acceptor site" 

6244 . . 8238 

/gene="env" 

6244 . . 8238 

/gene="env" 

/note="env precursor protein" 

/codon_start=l 

/protein id=" AAK5Q382 . 1 " 

/db_xref="GI : 13940449" 

/translation="MESTTLSKPFKNQVNPWGPLIVLLILGGVNPVTLENSPHQVFNL 

TWEVTNGDRETVWAITGNHPLWTWWPDLTPDLCMLALHGPSYWGLEYRAPFSPPPGPP 

CCSGSSDSTPGCSRDCEEPLTSYTPRCNTAWNRLKLSKVTHAHNGGFYVCPGPHRPRW 

ARSCGGPESFYCASWGCETTGRASWKPSSSWDYITVSNNLTSDQATPVCKGNEWCNSL 

TIRFTSFGKQATSWVTGHWWGLRLYVSGHDPGLIFGIRLKITDSGPRVPIGPNPVLSD 

RRPPSRPRPTRSPPPSNSTPTETPLTLPEPPPAGVENRLLNLVKGAYQALNLTSPDKT 

QECWLCLVSGPPYYEGVGVLGTYSNHTSAPGNCSVASQHKLTLSEVTGQGLCVGAVPK 

THQALCNTTQKTSDGSYYLAAPAGTIWACNTGLTPCLSTTVLDLTTDYCVLVELWPRV 

TYHSPSYVYHQFERRAKYKREPVSLTLALLLGGLTMGGIAAGVGTGTTALVATQQFQQ 

LQAAMHDDLKEVEKSITNLEKSLTSLSEVVLQNRRGLDLLFLKEGGLCAALKEECCFY 

ADHTGLVRDSiXIAKLRERLSQRQKLFESQQGWFEGLFNKSPWFTTLISTIMGPLIILLL 

ILLFGPCILNRLVQFIKDRISVVQALVLTQQYHQLKTIEN" 

7654 . . 8235 

/ gene="env" 

/product="pl5E" 



misc feature 


8273. . 


8285 




/note= 


"poly purine track" 


LTR 


8288. . 


8892 




/note= 


"3 'LTR" 


LTR 


8288. . 


8750 




/note= 


"U3 region of 3* LTR" 


repeat region 


8288. . 


8298 




/note= 


"5* inverted repeat of 3' 


CAAT siqnal 


8670. . 


8674 




/note= 


"CAAT box of 3 'LTR" 


TATA siqnal 


8721. . 


8725 




/note= 


"TATAA box of 3 'LTR" 


LTR 


8751. . 


8818 




/note= 


"R region of 3 'LTR" 


polyA signal 


8797. . 


8802 




/note= 


"poly-A signal of 3 'LTR" 


LTR 


8819. . 


8892 




/note= 


"U5 region of 3 'LTR" 


repeat region 


8882. . 


8892 



/note="3 ' inverted repeat of 3 'LTR" 
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BASE COUNT 2285 a 2550 c 2183 g 1874 t 

ORIGIN 

1 tgaaagaccc cttcataagg cttagccagc taactgcagt aacgccattt tgcaaggcat 
61 ggaaaaatac cagagctgat gttctcagaa aaacaagaac aaggaagtac agagaggctg 
121 gaaagtaccg ggactagggc caagaacaga tggttcccag aaacaggata tctgtggtta 
181 agcactaggg ccccggccca gggccaagaa cagatggttc ccagaaacag gatatctgtg 
241 gttaagcact agggccccgg cccagggcca agaacagatg gttcccagaa atagctaaaa 
301 caacaacagt ttcaagagac ccagaaactg tctcaaggtt cccagatgac cggggatcaa 
361 ccccaagcct catttaaact aaccaatcag ctcgcttctc gcttctgtac ccgcgcttat 
421 tgctgcccag ctctataaaa agggtaagaa ccccacactc ggcgcgccag tcctccgata 
481 gactgagtcg cccgggtacc cgtgtatcca ataaagcctt ttgctgttgc atccgaatcg 
541 tggtctcgct gatccttggg agggtctcct cagagtgatt gactgcccag cctgggggtc 
601 tttcatttgg gggctcgtcc gggatttgga gacccccgcc cagggaccac cgacccaccg 
661 tcgggaggta agctggccag cgatcgtttt gtctccgtct ctgtctttgt gcgtgtgtgt 
721 gtgtgtgccg gcatctactt tttgcgcctg cgtctgattc tgtactagtt agctaactag 
781 atctgtatct ggcggctccg tggaagaact gacgagttcg tattcccgac cgcagccctg 
841 ggagacgtct cagaggcatc gggggcccgc tgggtggccc aatcagtaag tccgagtcct 
901 gaccgattcg gactatttgg ggcccctcct ttgtcggagg ggtacgtggt tcttttagga 
961 gacgagaggt ccaagccctc gccgcctcca tctgaatttt gcttttcggt ttttcgccga 
1021 aaccgcagcg aatcttgtct gtctcagtat tgttttgtca tttgtctgtt cgttattgtt 
1081 ttggaccgct tctaaaaaca tgggacagac cgtaaccacc cctctgagtc tgaccctaaa 
1141 acactgggaa gatgtccagc gcatcgcgtc caatcagtcc gtagatgtca agaagagacg 
1201 ctgggtcacc ttctgctctg ccgagtggcc aactttcggt gtagggtggc cacaagatgg 
1261 tacttttaat ttggacatta ttctacaggt taaatctaag gtgttctctc ctggtcccca 
1321 cggacacccg gatcaggtcc catatattgt cacctgggag gctattgcct atgaaccccc 
1381 tccgtgggtc aaaccttttg tctctcccaa actctccccc tctccaaccg ctcccgtcct 
1441 cccatccggt ccttcgaccc aacctccgcc ccgatctgcc ctttaccctg ctcttacccc 
1501 ctctataaaa cccagacctt ctaaacctca ggttctctcc gataatggcg gacctctcat 
1561 tgaccttctc tcagaagacc ctccgccgta cggaggacag ggactgtcct cctctgacgg 
1621 agatggcgac agagaagagg ccacctccac ttctgagatt cctgccccct ctcccatagt 
1681 gtctcgcctg cggggcaaaa gagacccccc cgcggcagat tccaccacct ctcgggcttt 
1741 cccactccgt ttggggggta atggtcagtt gcagtactgg ccgttttcct cctctgatct 
1801 atataactgg aaaaataata atccttcctt ctctgaggat ccaggtaaac tgactgcatt 
1861 gattgaatcc gtcctcacca cccaccagcc cacctgggat gattgccagc aattattagg 
1921 gactctgctt accggggagg agaagcagcg ggtgctcctc caagcccgaa aggctgtccg 
1981 gggcaacgat gggcgcccca cccaactgcc caacgaggtt gacgctgctt ttccccttga 
2041 acgtcccgat tgggattaca ccacccaaag aggtaggaac aacctagttc tctatcgcca 
2101 gttgctttta gcaggtctcc aaaacgtggg ccgaagcccc accaatttgg ccaaggtaaa 
2161 aggaataacc cagggaccta atgagtcccc ctcagccttc ctagagagac tcaaggaagc 
2221 ctatcgcaga tacactcctt atgatcctga ggaccctggg caagaaacga atgtatctat 
2281 gtcattcatc tggcagtccg ctccagacat tggtcgaaag ttagagcggt tagaagactt 
2341 aaaaagtaaa actttaggag atttagtgag agaagccgaa aggttcttta ataaaagaga 
2401 aactccagaa gaaagagaag agcgtgttag aagggagacc gaggaaaagg aagagcggcg 
2461 tagggcagaa gaggagcaga aagagaaaga gagggaccgt aagagacata gagaaatgag 
2521 caaacttttg gccaccgtag ttagtggaca gagacaggat agacaggggg gagagcgaag 
2581 gaggccccaa cttgacaagg atcaatgtgc ctactgtaaa gaaaaaggac actgggctaa 
2641 agactgccca aagaagccac ggggtccccg aggaccgcga ccccagacct ccctcctgac 
2701 tttagacgac taggggggtc agggtcagga gcccccccct gaacccagga taaccctcac 
2761 tgtcgggggg caacccgtca ccttcctggt ggatactggg gcccaacact ccgtgctgac 
2821 tcaaaatcct gggcccctaa gtgacaggtc ggcttgggtc caaggggcta ctggaggaaa 
2881 gcggtatcgc tggaccacag atcgcaaggt acacctagct accggtaagg tcactcactc 
2941 tttcctccat gtgccagact gcccctaccc cttgctagga agagacttgt taactaaact 
3001 aaaagtccag atccactttg agggatcagg agcccaggtt gtgggaccaa aaggacagcc 
3061 tctacaggta ttgaccctaa acctagagga tgagtatcgg ctttatgaga cctcagcaga 
3121 gccggaagct tctcctgggt ccacctggct ctccgacttt ccccaggcct gggcagaaac 
3181 cgggggcatg ggactggcag ttcgccaggc gcctctaatc atacctctga aggcaacctc 
3241 cacccctgtg tccataaaac aataccccat gtcacaggaa gccagactgg ggatcaagcc 
3301 ccacatacag aggctgttgg accagggaat actggtaccc tgccagtccc cctggaatac 
3361 acccctgcta cccgttaaga aaccaggaac taacgattat aggcctgtcc aggatctgag 
3421 agaagtcaac aagcgggtgg aagacatcca ccccaccgtg cccaatcctt ataacctcct 
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3481 aagtgggctc ccaccgtccc accggtggta tactgtgctt gatttaaagg atgccttttt 

3541 ctgcctgaga ctccacccca ccagtcagcc tctcttcgcc ttcgagtgga gagatccagg 

3601 gatgggaatc tcaggacaat taacctggac cagactccca cagggtttca aaaacagtcc 

3661 caccctgttt gatgaggcac tgcacagaga cctagcaggc ttccggatcc agcacccaga 

3721 cttgatcctg ctacagtacg tggatgactt actgctggcc gccacttctg agctcgactg 

3781 ccaacaaggt actcgggccc tgttacaaac cctaggggac ctcgggtatc gggcctcggc 

3841 caagaaagcc caaatttgcc agaaacaggt caagtatctg gggtatcttc taaaagaggg 

3901 tcagagatgg ctgactgagg ccagaaaaga gactgtgatg gggcagccta ttccgaagac 

3961 ccctcgacaa ctaagggagt tcctagggac ggcaggcttc tgtcgcctct ggatccctgg 

4021 gtttgcagaa atggcagccc ccttgtaccc tctcaccaaa acggggactc tgtttaattg 

4081 gggcccagac cagcaaaagg cctatcaaga aatcaaacag gctcttctaa ctgccccagc 

4141 cctggggttg ccagatttga ctaagccctt tgaactcttt gtcgacgaga agcagggcta 

4201 cgccaaaggc gtcctaacgc aaaaactggg accttggcgt cggtcggtgg cctacctgtc 

4261 caaaaagcta gacccagtgg cagctgggtg gcccccttgc ctacggatgg tagcagccat 

4321 tgccgttctg acaaaggatg caggcaagct aactatggga cagccgctag tcattctggc 

4381 cccccatgcg gaagaagcac tggtcaaaca accccctgac cgctggctat ccaacgcccg 

4441 catgacccac taccaggcaa tgctcctaga cactgaccga gttcagttcg gaccagtggt 

4501 ggccctcaat cctgccacct tactccctct cccggaagaa ggagcccccc atgattgcct 

4561 cgagatcttg gctgaaacgc atggaaccag accggatctc accgaccagc ccatcccaga 

4 621 cgccgaccac acctggtatt ccgatgggag cagctttttg caagaaggac agcgaaaggc 

4 681 tggggcagcg gtgaccactg agaccgaggt aatctgggca agggcactgc cggctggaac 

4741 atctgcccaa cgggccgaac tgatagcact cactcaagcc ttgaagatgg cagaaggtaa 

4801 gaggctaaac gtttacactg acagccgata tgctttcgcc acggcccata tccatggaga 

4 861 gatctatagg aggcgagggt tgctaacctc agagggtaga gaaatcaaaa acaagagcga 

4 921 gatcctggct ttactaaaag ctctttttct gccaaagaga ctcagtataa tccactgcct 

4 981 ggggcatcaa aaaggagata gtgccgaggc taggggcaac cgcctagcag accaagcggc 

5041 ccgggaggca gccataaaga cgcctccaga tacatccact ctccttatag aggattcaac 

5101 cccatatacg cctgcctatt tccattatac tgaaacagat ctaaagaaac tcagagagct 

5161 tggggccacc tataaccaga gcaaaggata ctgggtcttc caaggcaagc cggtgatgcc 

5221 cgaccaattt gtatttgaac tgttagactc actccaccgg ctcacccacc tcggctacca 

5281 gaaaatgaaa gcactccttg acagaggaga aagcccctac tacatgctaa accgggacaa 

5341 aaccctccaa tatgtggcag attcctgcac ggtctgtgcc caagtaaatg ccagcaaagc 

5401 taaaatcggg gcaggagtga gagtacgagg acatcggcca ggctcccatt gggagatcga 

5461 ttttacagaa gtcaagccag ggctgtatgg gtacaagtac ctcctggtat tcgtggacac 

5521 cttctctggt tgggtggaag cctttccaac caagagagaa acagcaagag tcgtgtccaa 

5581 gaaattgctg gaagaaatat tcccgagatt cggaatgcca caggtattgg gatctgataa 

5641 cgggcctgcc ttcacctccc aggtaagtca gtcggtggcc gatttactgg ggatcgattg 

5701 gaaattacat tgtgcttata gaccccagag ttcaggtcag gtagaaagaa tgaatagaac 

5761 catcaaggag actctaacta aattaacgct tgcagctggc actagagact gggtactcct 

5821 actcccctta gctctttacc gagcccggaa cactccgggc ccccatggac tgactccgta 

5881 tgaaatcttg tacggggcgc ccccgcccct tgttaacttc catgaccccg acatgtcaga 

5941 attaactaat agcccatctc tccaagctca cttacaggcc ctccaaacgg tgcagcgaga 

6001 aatttggaaa ccactggccg aggcctaccg ggaccaacta gaccaaccag tgataccaca 

6061 ccccttccgg attggagact ccgtgtgggt gcgccggcac cagaccaaaa acttagaacc 

6121 tcgctggaag ggaccctaca ccgtcctact gaccaccccc accgctctca aggtagacgg 

6181 catctctgca tggatacacg ccgcccacgt caaggcagcg accacgcccc cgataaaacc 

6241 atcatggaga gtacaacgct ctcaaaaccc tttaaaaatc aggttaaccc gtggggcccc 

6301 ctaattgtcc ttctgattct cggaggggtc aaccccgtta cgttggaaaa cagcccccac 

6361 caggttttta acctcacctg ggaagtgact aatggagacc gagaaacggt gtgggcaata 

6421 accggcaatc accctctgtg gacttggtgg cctgacctca caccagatct ctgtatgttg 

6481 gccctccacg ggccgtccta ttggggccta gaatatcggg ctcctttttc tcctcccccg 

6541 gggcccccct gctgttcagg aagcagcgac tccacgccag gctgttccag agattgtgag 

6601 gagcccctga cttcatatac tccccggtgc aatacggcct ggaacagact taagttatct 

6661 aaagtgacac atgcacacaa tggaggattc tatgtctgcc ccgggccaca tcgcccccgg 

6721 tgggcccggt catgtggtgg tccagaatcc ttctattgtg cctcttgggg ctgcgaaacc 

6781 acaggccgag catcctggaa accatcctcg tcttgggact acatcacagt aagcaacaat 

6841 ctaacctcag accaggcaac cccagtatgc aaaggtaatg agtggtgcaa ctccttaact 

6901 atccggttca cgagctttgg aaaacaggcc acctcctggg tcacaggcca ttggtgggga 

6961 ttgcgcctat acgtctctgg acatgaccca gggctcatct ttgggatccg acttaaaatt 

7021 acagactcgg ggccccgggt cccaataggg ccaaaccccg tcttgtcaga ccggcgacca 
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7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 
7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 
8881 



ccttcccggc 
cccctcaccc 
aaaggagcct 
tgccttgtat 
catacctctg 
gtgaccggac 
accacccaga 
gcttgcaaca 
tactgtgtcc 
caccaatttg 
ctattaggag 
ctagtggcca 
gttgaaaagt 
cagaatcgta 
aaagaagaat 
cttagagaaa 
gggctgttta 
ataatcctct 
atcaaagaca 
aagacaatag 
gggggaatga 
aaggcatgga 
gaggctggaa 
gtggttaagc 
atctgtggtt 
gctaaaacaa 
ggatcaaccc 
cgcttattgc 
tccgatagac 
cgaatcgtgg 
gggggtcttt 



ctagacccac 
tccccgaacc 
accaagccct 
cgggaccccc 
ccccaggtaa 
agggactctg 
agacgagcga 
ccgggctcac 
tggtcgagct 
aaagacgagc 
gactcactat 
ctcagcagtt 
ccatcactaa 
gaggcctaga 
gctgtttcta 
gattgagtca 
ataagtcccc 
tgttaatttt 
ggatttcggt 
aaaattaaaa 
aagacccctt 
aaaataccag 
agtaccggga 
actagggccc 
aagcactagg 
caacagtttc 
caagcctcat 
tgcccagctc 
tgagtcgccc 
tctcgctgat 
ca 



cagatctccc 
cccgccagcg 
caacctcacc 
atactacgaa 
ctgttccgtg 
cgtaggagca 
cgggtcctac 
tccctgccta 
ttggccaagg 
caaatataaa 
gggcggaatt 
ccaacaactc 
tctagaaaaa 
tctactattc 
tgccgaccac 
gagacaaaag 
ttggttcacc 
actctttggg 
agtgcaggcc 
atcacgtgaa 
cataaggctt 
agctgatgtt 
ctagggccaa 
cggcccaggg 
gccccggccc 
aagagaccca 
ttaaactaac 
tataaaaagg 
gggtacccgt 
ccttgggagg 



ccgccttcaa 
ggagtcgaaa 
agtcctgata 
ggggttggcg 
gcctcccaac 
gttcccaaaa 
tatctggctg 
tctaccactg 
gtgacctacc 
agagaacccg 
gccgctggag 
caggctgcca 
tctttgacct 
ctaaaagagg 
acaggattag 
ctctttgaat 
accctgatat 
ccttgtattc 
ctggttctga 
taaaagattt 
agccagctaa 
ctcagaaaaa 
gaacagatgg 
ccaagaacag 
agggccaaga 
gaaactgtct 
caatcagctc 
gtaagaaccc 
gtatccaata 
gtctcctcag 



actccacccc 
accgattgtt 
aaacccaaga 
tcctaggtac 
acaagctgac 
cccatcaggc 
ctcccgccgg 
tactcgacct 
attcccctag 
tctcactaac 
tgggaacagg 
tgcacgatga 
ccttgtccga 
gaggtttgtg 
tacgggatag 
cccaacaagg 
ccaccatcat 
tcaatcgcct 
ctcaacaata 
tattcagttt 
ctgcagtaac 
caagaacaag 
ttcccagaaa 
atggttccca 
acagatggtt 
caaggttccc 
gcttctcgct 
cacactcggc 
aagccttttg 
agtgattgac 



aaccgagaca 
aaatctagta 
gtgctggtta 
ttattccaac 
cctgtccgaa 
cctgtgtaat 
gaccatttgg 
caccaccgat 
ttatgtttac 
tctggcccta 
gactaccgcc 
ccttaaagaa 
agtagtgtta 
tgctgcctta 
catggccaaa 
gtggtttgaa 
gggtcccctg 
ggtccagttt 
tcatcaactt 
acagaaagag 
gccattttgc 
gaagtacaga 
caggatatct 
gaaacaggat 
cccagaaata 
agatgaccgg 
tctgtacccg 
gcgccagtcc 
ctgttgcatc 
tgcccagcct 
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